CDS

Accession Number TCMCG078C28445
gbkey CDS
Protein Id KAG0499206.1
Location join(71898266..71898559,71898663..71898719,71899175..71899357,71899496..71899580,71900219..71900309,71900384..71900554,71900774..71900893,71900974..71901049,71903880..71904051,71904143..71904912,71905249..71905326)
Organism Vanilla planifolia
locus_tag HPP92_003897

Protein

Length 698aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000001.1
Definition hypothetical protein HPP92_003897 [Vanilla planifolia]
Locus_tag HPP92_003897

EGGNOG-MAPPER Annotation

COG_category K
Description Auxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs)
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K14486        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04075        [VIEW IN KEGG]
map04075        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGGATCGACCTGAACATGATTGAGGAAGAGGGGGACGACCAGTCGATGGTGTGCCAGGCGGAGCAGAGGCCGTCGCCGGTATCGGCCGGAGGGGTATGCCTGGAGTTGTGGCATGCGTGTGCGGGGCCGCGGATTTCGCTTCCGAAGAAGGGGAGCTTGGTGGTTTATTTGCCACAGGGACATCTGGAGCATCTCACCTTCGCTAACGGTGGTGCGGGCGGCGGAAGGATGTTCCTCCGCCACGACTTGCCTCCCCACTTGGTCTGCCGAGTCGTCGACGTCCAGCTGCGCGCGGATGCTACCACCGATGAGGTCTATTCCCAGCTCTCCCTGCTTGCAGAGGGCGAGGTTTTTGACAAGCAGATTCAGGAAGGAATCTTGGAGAAAGTAGGAGAAGTTGATGAAATGGACAGTGGAGGAAAATCCGCTGTTCCTCACATGTTCTGTAAGACACTCACTGCCTCCGACACGAGCACTCATGGGGGCTTTTCTGTTCCATGCCGTGCGGCCGAGGACTGCTTTCCTCAGCTGGATTACAAGCAGCAGAGACCATCTCAAGAGCTTATTGCTAAAGATTTGCATGGTGTGGAATGGTGTTTTAAGCACATATATAGGGGTCAACCACGTAGGCATTTGCTCACAACAGGTTGGACTGGTTTTGTCAACAAGAAGAAGCTTACCCCAGGGGATGCTGTACTTTTTCTTCGGGGTGATGATGGAGAACTTAGATTGGGAATTCGTAGAGCATCTCAATTCATCGGTAGTATTTCTTTTACAATGCCTTCAAGTCAAGGGACAACTGTTGGAGCTGGTGTTCTAGTATCAATAGCAAATGCTGTATCCTCAAAGAGTACATTTCAAATCAATTACAACCCAAGGGCAAACCATTCGGAGTTCATTGTCCCTTATTGGAAGTTCGCAAAGAGCTGTAACCATTCAATTTCTGTTGGGGATCAGTTTAAAGTGCGAATTGAGAGCAAAGATACCACAGAGAGAAGGTATACTGGATTAGTAACTGCAGTTTGTGACTTGGATCCTCTGCGATGGCCACGGTCAAAGTGGCGATGTCTTTTGGTTAGGTGGGATGATAATGATGTCATTGAGAACTGTAGAAGACACAGCAGGCTCTCTCCATGGGAGATAGAACCTATAGGATCCTTCTCCAGCCACAACAATCTTATAGCTTCTGGTTCAAAAAGAAGCAGAAGTAGCTTTCCTTTAGGAAATGTTGATTCTCCACATCCAATTGGAAGTGGTTTTGAGAACTTGGGTGAATTCACAAGGTCCCCCAAGGTCTTGCAAGGTCAAGAAATTTTGGGTTTCAAGGCATCTTATAAAGAGGTTCCTAACAGCTGTTTGGCGCCTGATTTAAAACGCAGTGATTTTTTATACAAAGGTATAGGTCTTGGGGGATCGATTCGGTCCCATAAGGTCTTGCAAGGTCAAGAAATTGTTCCCGTACTACCTTCGTATCACACAATGGCTGTCGAAAGCAGAAGAGAGGCCGTCGGATTGAAAACATTTGAACCATCCATGCCCCTTCAGGGATACTGTTCCTTTGTTCCGTCGAGCTCCTCTTCGGCACAGGTCACGTCTCCTTCCTCGGTTCTGATGTTTGGGCAAGCTTCGTCACCCATGCCACAGCTTCATTCATCATACGGCCTAGAGGAAAGGGAGAAAGCTGGCAATGGCTGCTGCTTGTCCATTCAGTTTGGTTCAGCCGAGGCATGCGATTCGATGCAGAAACCCTCTTCTGGTCGGTGGATTTGGAGCCGCACGCAGAAGCCCTCTTTTGCTCGGCCAATTCGGAAGAACGGCCAGAATGTCTCCAATGGAGGTGGGGGAAGCGGTTTCAGGCTGTTTGGCTTTCCCCTCACTGAGAAGAGTACTGTTGCAAGTGTGGTAGATGGCTCTTTGGGTGAGGGTTCTCTAATGGAGAAAGGCATTGAATCATCTTTCTCAAACCAAAGGGCAGCAAACATGAGTACAAAGAATGCTGGACATGGATGTACAAGGGGTTTCTTGCAGCCAGGTTTCTCCACCTCCAGGAGCCTGTTTGATAGTAGTGCATCTGTTTCTGATGTGGATGAATGA
Protein:  
MGIDLNMIEEEGDDQSMVCQAEQRPSPVSAGGVCLELWHACAGPRISLPKKGSLVVYLPQGHLEHLTFANGGAGGGRMFLRHDLPPHLVCRVVDVQLRADATTDEVYSQLSLLAEGEVFDKQIQEGILEKVGEVDEMDSGGKSAVPHMFCKTLTASDTSTHGGFSVPCRAAEDCFPQLDYKQQRPSQELIAKDLHGVEWCFKHIYRGQPRRHLLTTGWTGFVNKKKLTPGDAVLFLRGDDGELRLGIRRASQFIGSISFTMPSSQGTTVGAGVLVSIANAVSSKSTFQINYNPRANHSEFIVPYWKFAKSCNHSISVGDQFKVRIESKDTTERRYTGLVTAVCDLDPLRWPRSKWRCLLVRWDDNDVIENCRRHSRLSPWEIEPIGSFSSHNNLIASGSKRSRSSFPLGNVDSPHPIGSGFENLGEFTRSPKVLQGQEILGFKASYKEVPNSCLAPDLKRSDFLYKGIGLGGSIRSHKVLQGQEIVPVLPSYHTMAVESRREAVGLKTFEPSMPLQGYCSFVPSSSSSAQVTSPSSVLMFGQASSPMPQLHSSYGLEEREKAGNGCCLSIQFGSAEACDSMQKPSSGRWIWSRTQKPSFARPIRKNGQNVSNGGGGSGFRLFGFPLTEKSTVASVVDGSLGEGSLMEKGIESSFSNQRAANMSTKNAGHGCTRGFLQPGFSTSRSLFDSSASVSDVDE